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The N-terminal domain of the nucleocapsid protein from Middle East 
respiratory syndrome coronavirus (MERS-CoV NP-NTD) contains many 
positively charged residues and has been identified to be responsible for RNA 
binding during ribonucleocapsid formation by the virus. In this study, the 


domain; RNA binding. crystallization and crystallographic analysis of MERS-CoV NP-NTD (amino 
acids 39-165), with a molecular weight of 14.7 kDa, are reported. MERS-CoV 
NP-NTD was crystallized at 293 K using PEG 3350 as a precipitant and a 94.5% 
complete native data set was collected from a cooled crystal at 77 K to 2.63 A 
resolution with an overall Rinerge Of 9.6%. The crystals were monoclinic and 
belonged to space group P2,, with unit-cell parameters a = 35.60, b = 109.64, 
c= 91.99 A, B = 101.22°. The asymmetric unit contained four MERS-CoV NP- 


NTD molecules. 


Supporting information: this article has 
supporting information at journals.iucr.org/f 


1. Introduction 


Between 2003 and 2004, Severe acute respiratory syndrome 
coronavirus (SARS-CoV) caused a worldwide epidemic and 
had a significant economic impact in countries affected by 
the outbreak (Lai, 2003). In 2004, another alphacoronavirus, 
Human coronavirus NL63 (HCoV-NL63), was isolated from 
a seven-month-old child in the Netherlands suffering from 
bronchiolitis and conjunctivitis (Pyrc et al., 2004). In 2005, 
Woo and coworkers discovered the novel betacoronavirus 
Human coronavirus HKU1 in patients with respiratory-tract 
infections (Woo et al., 2005). Recently, Middle East respiratory 
syndrome coronavirus (MERS-CoV) has been found in 
patients with severe acute respiratory-tract infections in the 
Middle East (Woo et al., 2014). As is the case for all corona- 
viral infections, there are no efficacious therapies currently 
available against coronaviral diseases, making the develop- 
ment of anticoronaviral compounds a priority. The virion 
envelope surrounding the nucleocapsid contains the following 
structural proteins: S (spike), M (matrix), E (envelope) and N 
(nucleocapsid). Some of them have a third glycoprotein, HE 
(haemagglutinin-esterase), which is present in most alpha- 
coronaviruses. The primary function of the HCoV N protein is 
to recognize a stretch of RNA that serves as a packaging signal 
and leads to the formation of the ribonucleoprotein (RNP) 
complex during assembly (Chang et al., 2014). The RNP may 
be important in keeping the RNA in an ordered conformation 
suitable for replication and transcription of the viral genome 
(Lai, 2003; Nelson et al., 2000; Huang et al., 2004; Navas- 
Martin & Weiss, 2004). In addition, the N protein has been 
identified as an important diagnostic marker and the most 
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Table 1 


Macromolecule-production information. 


Source organism Middle East respiratory syndrome corona- 
VITUS 

The templates for the MERS-CoV N 
protein were provided by AIIBio Science 
Incorporation, Taichung, Taiwan. The 
primers for the MERS-CoV N protein 
were provided by Genomics BioSci & 
Tech Ltd, New Taipei City, Taiwan. 

CTTATCGCATATGAACACCGTGAGCTGGTATACC- 
GGC 

CTTACGGCTCGAGGGTGCCTTCAATATGAAAGTT- 
TTTCG 

pET-28a (Novagen) 

pET-28a (Novagen) 

Escherichia coli BL21 (DE3) pLysS 

HHHHHHSSGLVPRGSHMNTVSWYTGLTQHGKVPL- 
TFPPGQGVPLNANSTPAQNAGYWRRQDRKINT- 
GNGIKQLAPRWYFYYTGTGPEAALPFRAVKDG- 
IVWVHEDGATDAPSTFGTRNPNNDSAIVTQFA- 
PGTKLPKNFHIEGT 


DNA source 


Forward primer 


Reverse primer 


Cloning vector 

Expression vector 

Expression host 

Complete amino-acid sequence 
of the construct produced 


‘immunodominant’ antigen in infected hosts (Chan et al., 2005; 
Woo et al., 2004; Liang et al., 2013). 

The N protein of MERS-CoV, with a molecular weight of 
45.6 kDa and a pI of 10.05, is highly basic and hydrophilic 
(Woo et al., 2014). Previous studies revealed that the 
N-terminal domain of CoV (NP-NTD) contains mostly posi- 
tively charged residues, which are responsible for RNA 
binding, while the C-terminal domain (NP-CTD) mainly acts 
as an oligomerization module to form a capsid (Huang et al., 
2009; Saikatendu et al., 2007; Lo et al., 2013; Chen et al., 2013). 
The central disordered region of the N protein has also been 
shown to contain an RNA-binding region (Chang et al., 2009). 
We have shown that compounds that bind to the NP-NTD and 
interfere with NP—RNA interactions provide valuable leads 
for the development of anti-coronaviral therapeutics (Lin 
et al., 2014). The crystal structures of several NP-NTDs, 
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Figure 1 

SDS-PAGE analysis of MERS-CoV NP-NTD stained with Coomassie 
Brilliant Blue. Lane M, protein molecular-mass marker (labelled in kDa); 
lane 1, concentrated MERS-CoV NP-NTD after dialysis; lane 2, purified 
MERS-CoV NP-NTD. 
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including those from SARS-CoV, Infectious bronchitis virus 
(IBV), Human coronavirus OC43 (HCoV-OC43) and Mouse 
hepatitis virus (MHV), have been described (Chen et al., 2013; 
Ma et al., 2010; Saikatendu et al., 2007; Jayaram et al., 2006; Yu 
et al., 2006; Fan et al., 2005). In order to clarify the mechanism 
by which the N protein of MERS-CoV bind to nucleic acids, 
we have undertaken the determination of the crystal structure 
of the N-terminal domain of MERS-CoV spanning residues 
39-165, which shares 58% identity to NP-NTD of SARS-CoV. 
The results presented in this paper mainly concern the crys- 
tallization and preliminary X-ray structural analysis of MERS- 
CoV NP-NTD. 


2. Materials and methods 
2.1. Macromolecule production 


The templates for the MERS-CoV N protein were 
purchased from AIIBio Science Incorporated, Taichung, 
Taiwan. Truncated forms of recombinant MERS-CoV NP- 
NTD were generated by polymerase chain reaction (PCR) of a 
pGENT plasmid encoding the N-protein gene using different 
primers. The PCR products were digested with NdeI and 
XhoI, and the DNA fragments were cloned into pEI-28a 
(Novagen) using T4 ligase (NEB). Expression of the protein 
was initiated by adding IPTG to a final concentration of 1 mM 
followed by incubation at 10°C for 24 h. After harvesting the 
bacteria by centrifugation (8000 rev min~', 12 min, 4°C), the 
bacterial pellets were resuspended in lysis buffer (SO mM Tris— 
HCl, 150 mM NaCl, 15 mM imidazole, 1 mM PMSF pH 7.5) 
and lysed by sonication on ice using 3 s pulses with 6 s pauses 
for a total of 10 min. Soluble proteins were obtained from the 
supernatant after centrifugation (13 000 rev min ', 40 min, 
4°C). NP-NTD carrying an N-terminal His, tag fused to a 
SSGLVPRGSHM linker sequence was purified using an Ni-— 
NTA column (Novagen) and eluted with a buffered imidazole 
gradient of 15-250 mM. Fractions containing pure protein 


Figure 2 

Crystals of MERS-CoV NP-NTD obtained by the sitting-drop vapour- 
diffusion method. The largest crystals in the needle clusters are 
approximately 300 x 20 x 10 um in size. 
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were collected at ~150mM imidazole and were dialyzed 
against 50 mM Tris-buffered solution at pH 7.5 containing 
150 mM NaCl for 3h at 4°C (Fig. 1). The purified NP-NTD 
was further concentrated using Amicon Ultra centrifugal filter 
units and centrifuged at 3500g for 10 min at 4°C several times 
until the concentration of NP-NTD reached 10 mg ml‘ as 
determined by the Bradford method (BioShop Canada Inc.). 
Macromolecule-production information is summarized in 
Table 1. 


2.2. Crystallization 


Initial crystallization conditions were identified by using the 
sitting-drop vapour-diffusion method with crystal screening 
kits from Molecular Dimensions as described previously (Till 
et al., 2013; Chen et al., 2014). Each of the solutions (2 ul) from 
the crystal screening kits was mixed with 2 ul purified protein 
solution (10mg ml~') and allowed to equilibrate against 
300 ul solution in the well at room temperature (~25°C). The 
conditions were refined and crystals were grown from a well 
solution using the sitting-drop vapour-diffusion method by 
equilibrating a mixture of 2 tl protein solution (10 mg ml’) 
and 2 ul reservoir solution against 300 ul reservoir solution 
consisting of 2mM NaBr, 75mM ammonium sulfate, 29% 
PEG 3350 (Sigma). Crystals appeared within two weeks and 
the largest crystal in the needle clusters grew to dimensions of 
approximately 300 x 20 x 10 um (Fig. 2). Crystallization 
information is summarized in Table 2. 


2.3. Data collection and processing 


X-ray data were collected using synchrotron radiation with 
a cryStal-to-detector distance of 350 mm. The oscillation width 
and exposure time for each frame were 1° and 20s, respec- 
tively. Crystallographic data integration and reduction were 


Table 2 

Crystallization. 

Method Vapour diffusion 

Plate type 24-well sitting-drop plate (Hampton 

Research) 

Temperature (K) 293 

Protein concentration (mg ml7') 10 

Buffer composition of protein 50 mM Tris-HCl pH 7.5, 75 mM NaCl 
solution 


2mM NaBr, 75 mM ammonium sulfate, 
29% PEG 3350 

1:1; 2 pl reservoir solution was mixed 
with 2 pl purified protein solution 

Volume of reservoir (11) 300 


Composition of reservoir solution 


Volume and ratio of drop 


performed with the HKL-2000 program package (Otwinowski 
& Minor, 1997). The crystallographic data-collection statistics 
for NP-NTD are listed in Table 3. 


3. Results and discussion 


The MERS-CoV NP-NTD crystal chosen for this study 
diffracted to 2.63 A resolution (Fig. 3) and belonged to space 
eroup P2,, with unit-cell parameters a = 35.60, b = 109.64, 
c = 91.99 A, B = 101.22°. The Matthews coefficient of 
2.63 A? Da! calculated using MATTHEWS_COEF from 
CCP4 (Winn et al., 2011; Matthews, 1968) suggested that there 
were four molecules in an asymmetric unit with a solvent 
content of 59.2%. A homology search for the MERS-CoV 
NP-NTD structure was performed using the BLAST server 
(http://blast.ncbi.nlm.nih.gov/Blast.cgi). The model-selection 
criterion was based on the E-value and the estimated precision 
value. The sequence-alignment search indicated that MERS- 
CoV NP-NTD shares high identity with other NP-NTDs from 
coronaviruses (Supplementary Fig. S1). For example, the 
MERS-CoV NP-NTD (residues 39-165) has 58% sequence 


Figure 3 
Typical X-ray diffraction pattern of MERS-CoV NP-NTD. 
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Table 3 


Data collection and processing. 


Values in parentheses are for the outer shell. 


Diffraction source 


BL13B1 beamline, National Synchrotron 
Radiation Research Center (NSRRC), 
Hsinchu, Taiwan 


Wavelength (A) 1.000 
Temperature (K) 77 
Detector ADSC Q315r 
Crystal-to-detector distance (mm) 350 
Rotation range per image (°) 1 
Total rotation range (°) 360 
Exposure time per image (s) 20 
Space group P2i 
a, b, c (A) 35.60, 109.64, 91.99 
a, B, vy (°) 90, 101.22, 90 
Mosaicity (°) 1.053 
Resolution range (A) 30-2.63 (2.73-2.63) 
Total No. of reflections 138439 
No. of unique reflections 13974 
Completeness (%) 94.5 (91.4) 
Multiplicity 7.1 (5.9) 
(I/o(1)) 18.89 (3.03) 
Resie. 0.01 (0.08) 
Overall B factor from Wilson 43.1 

plot (A) 


identity to SARS-CoV NP-NTD. The NTD from SARS-CoV 
(PDB entry 2ofz; Saikatendu et al., 2007) was chosen as the 
initial model as its E-value was 1 x 10~7°, and a total of 116 
residues were modelled. The first molecular-replacement trial 
was performed using the PERON automated interface at the 
Protein Tectonics Platform (PTP), RIKEN SPring-8 Center, 
Japan (Sugahara et al., 2008). The core of the model consisted 
of a tightly packed 6-sheet surrounded by large loops. The 
molecular-replacement method was then applied to the model 
using MOLREP (Vagin & Teplyakov, 2010) using reflections 
in the resolution range 8.5-3.0 A. Single and unambiguous 
solutions for four NP-NTD molecules in one asymmetric unit 
were obtained in the rotation and translation functions, 
yielding a final correlation coefficient of 0.81 and an R factor 
of 0.32. Structural refinement of MERS-CoV NP-NTD is 
currently in progress. 
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